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METHOD OF DETERMINING A GENOTYPE BY COMPARING THE NUCLEOTIDE SEQUENCE OF 
MEMBERS OF A GENE FAMILY AND KIT THEREFOR 

5 The invention relates to a method of determining a genotype by 

comparing members of a gene family, and also to a kit for tracing 
genetic variations in a gene family, 

A gene system is composed of different loci which are in turn 
composed of one or more genes; these genes may contain variations , ao- 

10 called different alleles, for each system. The immunoglobulin supergene 
family, which also includes , inter alia, the T-cell receptor, the immu- 
noglobulin and the HLA (= human leukocyte antigen) gene systems, is 
characterised by the presence of a large variation (polymorphism) . Each 
of the genes which form part of the gene systems is involved in the 

15 immune response in one way or another. Defects in the immune response • 
which ere due to diverse variations in one or more of the gene ays terns 
already mentioned, may result in disease. 

Genetic variation may thus be associated with diseases or diaeaae 
symptoms. Identification of loci/ alleles may therefore, for example, be 

20 of importance in determining the risk of a disease associated with HLA 
or in detecting variations in the gene system which result in a deviat- 
ion, and variations for which the latter is not the case. Thus, the HLA- 
B27 antigen is/are found in 90* of all patients having rheumatoid spon- 
dylitis, and HLA-DR3 or HLA-DK*, or both, is/are found in 80% of 

25 patients afflicted with insulin-dependent diabetes. Anyone having 
HLA-B27 has 180 times as much chance of getting rheumatoid spondylitis 
as a person without HLA-B27* Table I (taken from Biotechnologie , een 
nieuwe industriSle revolutie (Biotechnology, A New Industrial Revolu- 
tion), Antebi, E. and Fishlock. D. , published by Natuur en Techniek, 

30 Maastricht/Brussels (1987) page 91 > shows the most important diseases 
which are known to be associated with the HLA system. These locus and 
allele associations may render clinical practice a great service. They 
are sometimes used for diagnosis and sometimes in the treatment of 
certain diseases. They furthermore open up the possibility of preventi- 

35 on. It is possible to identify those which have the greatest risk of a 
particular disease. That is very important in those situations in which 
preventive or early treatment is possible and the development of the 
disease can be contained. 
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Table I 

The most important diseases associated with the HLA system 



5 HLAs Pathology 



Relative risk 



« 



A1 Hodgkin's disease 

A 3 Idiopathic haenochrosatosls 

B 5 Behcet's disease 

10 Bill. B17 Congenital adrenal hyperplasia 

B27 Rheumatoid spondylitis 

B 35 Subacute thyroiditis 

Cw6 Psoriasis 

DR2 Multiple sclerosis 

15 DR 3 Systemic lupus erythematosus 

DR 3 Addison's disease 

DR 3 Basedow's disease 

DR 3 Myasthenia gravis 

DR 3 Bxtramembranous glomerulopathy 12. 0 

20 DR3 and/or DR4 Insulin-dependent diabetes 6.1, 

DR3 and/or DR7 Coelisc disease i,] 8 

DRl * Rheumatoid arthritis 
Pemphigus 

DR 5 Biermer's anaemia 

2 5 DR5 Hashimoto's disease 

DR5, DRw8 Juvenile arthritis 5.2 



30 



1.* 
8.2 
6.3 
15. 4 
87.4 
13-7 
133 
4.1 
5.8 
6.3 
3.7 
2.5 



4.0 
14.4 
5.4 
5.6 



The relative risk indicates the factor by which the risk of getting 
a disease is multiplied for people having the HLAs concerned. 
Individuals having group HLA-Cw6 have, for example. 13.3 times as 
much chance of getting psoriasis as people not having said antigen. 



Genetic variation in the HLA system is furthermore of importance in 
view of the fact that acceptance in organ and tissue transplants is 
35 associated with the presence of the same HLA alleles in the white blood 
cells, while rejection is related to the presence of various HLA alleles 
in the white blood cells. 

In man the HLA gene family covers a region of 1.500,000 base pairs. 
This region codes for many genes. Thus, this region contains, in additi- 
on to the genes of HLA class I and HI.A class II, inter alia, genes of 



40 
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the complement system f 21 -hydroxylase genes and genes which code for 
tumour necrosis factor. 

The HLA class I region contains 3 "Major" loci: HLA- A, HLA-B and 
HLA-C. The molecules for which said loci code are expressed in all 
5 nucleated cells* The molecules of class 1 are composed of a polymorphous 
o-chain having a molecular weight (MW) of 44.000 D and a constant chain, 
called ^-microglobulin, having a MW of 12,000 D. 

The HLA class XX region contains the loci HLA-DR t HLA-DQ and 
HLA-DP* In addition to said loci, a number of genes are also described 
10 which do not belong to one of the abovementloned loci and are for the 
time being considered to be separate loci (for example, HLA-E; HLA-F; 
HLA-DN; HLA-DZ) * The molecules of class II are composed of an o-chain 
(MW 34,000 D) and a p-chain (MW 29,000 D) . The a-chain of HLA- DR. is 
constant, i.e. no variants have yet been found. HLA-DR contains 
15 3 &-genes, of which two genes code for a product and one gene is a 
pseudogene which is therefore not expressed* Each of the chains can 
form dimeric molecules with the o-chain. The HLA-DQ locus contains four 
(two a- and two P-) genes, of which two are expressed, namely DQa and 
DQP (termed DQA and DOB on the recommendation of the latest nomenclature. 
20 committee) . The molecule of the other genes (DXA and DXB) has not been 
found, although the structure of said genes indicates no characteristic 
pseudogenes. The HLA-DP locus contains four genes (two o and two &).,bf 
which two are expressed. Polymorphism has not yet been demonstrated for 
DPA, and only a few alleles of DPB have been described* The other two DP 
25 genes are pseudogenes. 

A method of detecting specific nucleotide variations and genetic 
polymorphisms which are present in nucleic acids, and also a kit for 
carrying out said method are described in EP-A-237t362. Said method 
comprises amplifying nucleic acid sequences in which the presence of 
30 nucleotide variations, mutations and polymorphisms is assumed, followed 
by the detection thereof with an cl ©specific oligonucleotide sequences 
using a dot blot. In this process, allele-speciflc oligonucleotide 
sequences are used (see page 22 of the said patent application) which 
comprise at least the DNA sequence with the variation to be detected and 
35 have to be specific for the nucleotide variation to be detected. The kit 
therefore comprises many different primers which code for diverse known 
variations in the DNA sequence, to be investigated, of the gene system. 

Immobilised sequence-specific oligonucleotide probes for the gene- 
tic analysis of amplified DNA arc disclosed by the International Patent 
ifO Application W0 89/ 11548 and by R- K. Saiki et al.. Proc. Natl. Acad. 



WO 92/08117 



PCT/US91/07308 



10 



15 



Sci. USA, 86 , 6230-6234 (1989). 

A method of determining genotype* as regards the alleles in the 
HLA-DP loci by hybridising nucleic acid samples with a aeries of probes 
which are apecific for various segments is disclosed by the Internatio- 
nal Patent Application WO 89/115^7, 

A »ethod of distinguishing nucleotide sequences by hybridising 
amplified DNA with linkable probes is disclosed by the International 
Patent Application WO 90/01563. 

These known methods have the following disadvantages: 

a) It is only possible to trace known variations in the DNA sequen- 
ce of a particular gene Bystem, as a result of which any hitherto un- 
known variations in the gene system to be investigated cannot be dis- 
covered by said method. 

b) It is necessary to include various oligonucleotides for every 
known variation in the kit and to use them in the method t with the 
result that, in complex gene systems which comprise many possible 
alleles (such as. for example, the Hi.* system), a different oligonucleo- 
tide has to be used for every possible allele and a different hybridisa- 
tion has to be carried out. which is cumbersome, time-consuming and. 

20 expensive * 

HLA TYPING 
Class I serology 

The HLA class I molecules are determined with the aid of antisera 
25 which are specific for individual alleles- In most cases, alloantisera 
are used because monoclonal antibodies against all the allele forms are 
not available. It is improbable that they will become available in the 
short term because the variation which characterises the polymorphism of 
the various alleles is localised in a small region of the class I gene. 
30 Class I protein 

Protein-chemistry determination of the class I alleles is possible 
by means of immune precipitations with monoclonal antisera against 
constant sections of the molecules, followed by isoelectric focusing 
(IEF) . 
35 Class I DNA 

Typing is possible to a limited extent by means of determinations 
of restriction fragment length polymorphism <RFLP) in which use is made 
of class I cDNA probes* Further refinement has yielded probes which are 
locus-specific, me a result of which clusters of alleles and some 
40 separate alleles can be detected. Typing of class I alleles by means of 
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allele-specific oligohybridi sat ions (ASO) is not considered possible as 
a result of the large number of different alleles and. consequently, 
the large number of oligoe which is necessary. Routine typing: at the DNA 
level is difficult to achieve by this Method. 
5 Class II serology is more difficult to carry out in practice, firstly 
because of the lack of well-defined antisera and secondly because of the 
elaborateness of the Identification technique, viz. two-colour fluores- 
cence . 

Class II proteins can be characterised by Mans of IEF as a result of 
10 the availability of nonoclonal antibodies, but the diversity of alleles 
in tens of variation and number makes the standard typing difficult, 
especially in heterozygous individuals* 
Class II DNA 

RFLP analyses can be used to identify DRA, DQA, DQB and DPB 
15 alleles. In this connection, some alleles are placed in a "supertyplc 
Croup" because they cannot be distinguished individually, but stand out 
as a group of alleles having a specific RFLP pattern from the other 
alleles . 

The technique of Southern blot analyses resulting in RPLP patterns 
20 is readily usable for typing class II. Viewed technically, this approach 
is not suitable for routine class II typing because of its elaborate- 
ness. 

The development of ASO typing with synthetic oligonucleotides on 
amplified DNA by means of the polymerase chain reaction (PGR) has now 
25 made routine typing of DQA possible. Typing will have to be developed 
still further for DRB, DQB and DPB. A disadvantage of this method is 
that a large number of allele-specific oligos has to be used as the 
number of alleles of the locus concerned increases. 
Workshop 

30 Every year the results of the ULA typing centres distributed around 

the world are examined at a workshop, sera being selected and defined as 
reference sera. The improvements in typing by means of protein-chemistry 
determinations. DNA/RF1-P and DNA-ASO analysis is facilitating the stand- 
ardisation of alleles determined with antisera. 

35 As regards protein typing, the workshop determines which monoclonal 

antibodies have to be used and how the data have to be Interpreted. For 
DNA-RFLP typing, the enzyme-probe combination is determined, in addition 
to a standardisation of DNA isolations and hybridisations. 

DNA-ASO typing is offered as o package by Perkin-Elmer/Cetus Corpo- 

40 ration (patented and marketed as a kit) and can now be used for DQA 
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Attempt* are being made to find a combination of the various 
methods which can be carried out uniformly by every typing centre 
throughout the world. In this connection, the reproducibility of the 
5 determinations in various typing centres, the uniformity of reagents 
(sera, probes, oligos) and the possibility of carrying out quality 
controls of the typing play en important role. 

In the abovenentioned cases, the approach to typing iB based on the 
characterisation of the polymorphous sections of the molecules or genes 
10 of the loci. 

It has now been found that use can advantageously be made of 
conserved sequences which flank the polymorphous' system, to determine 
genetic variations. 

Ihe method according to the invention for determining a genotype by 
15 comparing the nucleotide sequence of members of a gene system is there- 
fore characterised in that the nucleotide sequence of a locus or gene of 
a strongly conserved section is compared in which the polymorphism is 
localised the genetic material. 

An important advantage of the method according to the invention is 
20 that deviations in a gene system, and especially deviations which are 
expressed, can be traced more easily and with greater certainty by means 
of it. More particularly, the advantages of the present method are: 

New mutations can be detected directly and can then be investigated 
in more detail for functionality. This is in contrast to serological 
25 typing, in which at most a cross reaction with other sera but more often 
no reaction, is found. ASO typing characterises them as negative for 
that allele. 

The method is independent of known polymorphisms compared with 
other determinations. 
30 - Owing to the simplicity of the system (high degree of reproducibi- 
lity throughout the world), standardisation of typing will be simpli- 
fied. 

The method is independent of geographically determined internal 
variations in blood cells , serum and expression of the alleles. 
35 - At the same time the alleles of different loci can be determined in 
one test, as a result of which the association with diseases becomes 
known in a broader context (more loci or gene systems). 

The nomenclature for the gene systems can be localised in a simple 
manner by definitions of polymorphous parts of the loci/genes by using 
HO the invention. 
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The strongly conserved section of the genetic material whose nucle- 
otide sequence Is compared, according to the invention, has a larger 
homology than the polymorphous (less conserved) sections whose sequence 
is compared in the known methods. The homology at base level. I.e. .the 
5 percentage of nucleotides which is characteristic of a locus or of the 
sllele forms of the genes Is at least approximately 50% and preferably 
at least 5&%. 

The method according to the invention is especially usable for 
determining haplotypes of a gene system. 

10 In particular, the method can be used to compare loci of a gene 

system and alleles of a gene system, such as. the gene system of the 
histocompatibility system. At the same time, it may be a matter of a 
routine determination of alleles in which a specific typing/identifi- 
cation of the presence of an allele is required (for example, in bone 

15 marrow transplants or blood transfusions). 

A gene system which is also advantageously suitable for Investiga- 
tion with the method according to the invention is a gene system of one 
of the members of the immunoglobulin supergene family , in particular the 
genes of an immunoglobulin (Ig) or the genes of the T-cell receptor. 

20 (TCR) . 

The method according to the invention is also suitable for research 
purposes in which a simpler determination of polymorphous sequences 
other than those which are characteristic for typing is possible in a 
gene as a result of selection of other conserved flanking sequences. 
25 The determination of the genotype with the method according to the 

invention relates, in particular, to tissue typing or cell typing. This 
can be done, for example, in order to determine the degree of compatibi- 
lity of transplants. If no, or very few, deviations are found in the 
polymorphous region between donor and acceptor, a transplant will have a 
30 high chance of taking because no rejection takes place on the basis of 
HLAs foreign to the body. Xt is also possible to carry out tissue typing 
or cell typing in order to determine the risk of an HLA-associated 
disease for an animal. A particular aspect in this connection is the 
uniform performebility and introduction of typing possibilities for. for 
35 example, HLA class I alleles which could hitherto be determined only by 
serological techniques • 

Moreover, a gene system of a bacterium, of a virus or of lipo- 
proteins can be used as the gene system whose genetic variation is 
Investigated. 

40 Tho investigation of the possible variation in the nucleotide 
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sequence of a strongly conserved region with the aid of the method 
according to the invention can readily be carried out if the strongly 
conserved section to be investigated is bounded on at least one side, 
and preferably on both aides, by a completely conserved sequence, i.e. a 
5 sequence which is completely . identical in all individuals. Said comple- 
tely conserved sequences have, in particular, a length of at least 5 
nucleotides and. particularly preferably, a length of at least 10 
nucleotides. 

The strongly conserved nucleotide sequences to be investigated can 
10 be amplified using the completely conserved sequences. A method which is 
used with advantage for amplifying the strongly conserved nucleotide 
sequence is a PCR (polymerase chain reaction) (Saiki R.K.. Gelfand D.H.. 
Stoffel S.. Scharf S.J.. Higuchi R. . Hom O.T.. Mullis K.B. and Erlich 
H.A. (1988). Primer directed enzymatic amplification of DNA with a 
15 thermostable DNA polymeraae. Science 222. 487-<»91). Other methods of 
amplifying the sequence to be investigated are also usable in the method 
according to the Invention. 

The sequence amplified in this way can then be aubjected to direct 
sequence analysis, to denaturing gradient gel electrophoresis (DOGE) or 
20 to temperature-gradient gel electrophoresis (TOCE). The extent to which 
the denaturing gel system works depends on the discriminating capacity 
of the gel in relation to the number of different alleles. In other 
words . the gel system will work well if. there is only a limited number 
of alleles of a locus, such as the lipoprotein alleles. HLA-DQA 
25 alleles. HLA-DPB alleles and possibly HLA-DQB alleles, and virus types, 
and can be used to determine expression of different loci in different 
cells or tissues. 

The amplified DNA or RNA can also be analysed (« typed)by direct 
sequence determination methods. For routine typing determinations, both 

30 analyses may be carried out informatively, reproducibly and in a 
controlled manner. For gene systems containing several alleles, direct 
sequence analysis will facilitate the interpretation and therefore the 
reproducibility by the various centres. The direct sequence analysis 
method also has advantages for research purposes. 

35' Determination of modifications has hitherto been carried out at DNA 

level by means of Southern blotting. The different families of TCR and 
Ig genes can be detected by means of specific amplification and sequenc- 
ing or DGGE, while the various members of the family can be distin- 
guished by a set of family-specific primers. Another gene system which 

40 is directly suitable for this approach is the determination of diverse 
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viruses, for example HPV1-16. 

Selection of primers 

Many Alleles of a large number of genes have been sequenced for 
5 research purposes. These sequences are stored in the EMBL databank <for 
Europe) and in the .Gene bank (USA) . both of which are accessible to 
subscribers. On the basis of these sequences, the allele-specific oligos 
(ASO). for example, have been selected from the polymorphous regions and 
these have then been used to identify different alleles. Host gene 
10 systems contain regions or sequences which are not subject to, hardly 
subject to or very strongly subject to mutations. For the HLA class II 
gene system, this has been investigated in detail in order to localise 
the polymorphous regions. References: Oustafsson K. . Emmoth E. , 
Larhammer D. , Bohme J.. Hyldig-Nielsen J.J.. Peterson P.A. and Rask L. 
15 U9M). Nutations and selection in the generation of class II hiatocos- 
pstibility antigen polymorphism. Embo J. J, 1655-1661$ Oustafsson K. . 
Wiaan K. , Larhammer D. , Rask L. and Peterson P.A. (1984). Signal 
sequences distinguish class II histocompatibility antigen beta. chains of 
different loci, Scand. J. Immunol. 12 , 91-97. 
20 The presence of usable sequences for the method according to the 

invention can be detected by more detailed analysis of HLA class I and 
II sequences which were available from the sequence databanks (EMBL and 
Oenebank). supplemented by sel f -determined sequences . Primers for class 

I and other gene, systems have been selected from the sequence databank 
25 and can be used. Primers based on conserved sequences for the HLA class 

II loci ORB. DQA, DOB and DPB can be selected on the basis of the 
sequences already known. Some usable primers are reported in one of the 
examples below. It is within the scope of the person skilled in the art 
to select and to test the completely conserved sequences (primers) sult- 

30 able for the typing he plans. 



Applications 

To determine lipoprotein polymorphism, the ASO can be radioactlvely 
labelled and used as a probe for typing the various alleles which are 

35. present in lipoprotein E. Three frequently occurring alleles can also be 
determined by means of restriction enzyme analysis because, in the case 
of these three alleles in the polymorphous section, the differences are 
localised at restriction enzyne sites. All five different alleles can 
be determined directly by a denaturing gel electrophoresis (Sheffield et 

40 al. , Identifying DNA polymorphisms by denaturing gradient gel electro- 
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Phoresis, in PCR Protocols: A Guide to Methods and- Applications. 
Academic Press Inc.. 1990. pages 206-218) . There are indication, that 
the different alleles can be detected by direct sequencing .ince the 
occurrence of the different alleles i. based on differences in nucleo- 
5 tide sequences (McBride et al. . Cin. Che*. 35. 2196-2201 (1989)). 

In the case of HLA typing, "constant primers" were used to amplify 
DRB genes of a family ln whlch ^ abnormal HLA type occurred. After 
amplification of the DRB genes, cloning in vector, and subsequent 
eequence analysis, it was found that, the "abnormal" HLA type is in fact 
10 a „e« haplotype which cannot be detected by known techniques. In this 
analysis it was found that the identification of the various alleles can 
be carried out .imply b y direct sequencing and localisation of the 
thymidine nucleotides present in the allele (T-tracking) . Automatic 
sequence determinations have already been carried out for the sequencing 
Of HLA-OQA (McBride et al.. 1 9 8 9 . see above). The method followed by 
them can be used directly with slight modifications for automatic 
sequence systems to type the alleles of a locus. 

After the primers have been .elected, "kits" can be assembled for 
the various gene systems, which kits contain, inter alia, .aid primers 
a. a result of which a standardised method is obtained for typing the 
various alleles. In the first instance, the ApoE end HLA-DRB , DQA DOB 
and DPB alleles are investigated for general applicability with 
denaturing gels and by means of sequence analysis. 

The invention therefore also relates to a kit for tracing genetic 
25 variations in a gene family which contains one or two completely 
conserved sequence, having a length of at least 5 nucleotides as 
pri»er{.) and possibly other mens for amplifying and analysing a 
•trongly conserved section of the genetic material to be investigated, 
which section is flanked by the completely conserved nucleotide 
30 aequence(s) . 

EXAMPLE 1 

Primers based on conserved sequences have been selected for the HLA 
class II loci DRB. DQA. DQB and DPB and tested for usability. Shown 
35 below are the sequences of some universal primers which have been selec- 
ted and tested for an amplification reaction (PCR). Two primers are 
preferably used for each reaction. 



20 
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DRB5: 
DRB3: 



5' CC AGO ACQ TIT C 3' 
5 f TT GTR TCT OCA 3' (R 



G/A) 



DQA5: 5* CCA TGA ATT TCA TGG AGA 3 f 

5 DQA3: 5' ATC GTT TAA TCA 3* 

DQA3.1: 5* TGT TCA ACT TRT GTT TT 3* (R * G/A) 

DQA3.2: 5 f ACA GCS ATG TTT STC AGT GCA 3' (S « 0/C) 

N.B. DQA5 is used In combination with DQA3, DQA3-1 or DQA3.2. 

10 DXA3.2: 5 f ACA GCS ATA TTT STC AOT OCA 3 # <S - G/C) 

DXA3: 5 f TCT GCS GOT CAA AAC T 3* 

N.B. DXA3 and DXA3.2 are used in combination with DQA.5. 



DQB5: 5* TGT OCT ACT TCA CCA A 3* 

15 DQB3: 5* GTA GTT GTG TCT GCA y 

DPBJ: 5* GCA GGA ATG CTA CGC GTT TA 3' 

DPB3: 5' CCA GCT CGT AGT TGT OTC TGC 3' 

DPB3SP (alternative for DPB3 ) : 



20 5 1 TTG TAA AAC GAC GGC CAG TCC AGO TCG TAG TTG TGT CTG C 3* 



APOE.F4:5' ACA GAA TTC GCC CCG GCC TGG TAC AC 3* 
APOE.F6:5' TAA GCT TGG CAC GGC TGT CCA AGO A 3' 

The combinations DRB5/DRB3. DQA5/DQA3.2 and DPB5/DPB3SP have been sue- 
25 cessfully used for sequences. 

EXAMPLE 2 

In the family G (two parents, four children) cross reaction was 
found in the determination for HLA class II alleles having different 

30 typing sera by means of the conventional HLA typing, method. It was not 
possible to establish any typing by this means. More detailed analysis 
by means of restriction fragment length polymorphism (RFLP) and ASO 
typing resulted in the hypothesis that it was not two alleles of HLA-DRB 
which occurred in this family in different members of the family, but 

35 three, and this had implications for a disease of one of the members of 
the family to be treated (Tilanus M.G.J. » Van Eggermond M.C.J. A., Pel 
H. t Scheuder G.M.Th. and Giphart M.J. (1989) t Family with an apparent 
HLA-DR triplet: Evidence for exchange of functional HLA-DR beta genes 
between different haplotypes, Exp. Clin. Imnmnogenetlcs 6, 162-168). The 

ftO DNA of this family was studied in more detail in order to be able to 
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establish the genetic basis for this phenomenon. One of the techniques 
which wee „ B ed in doing this consisted of sequence analyis of the 
various alleles of HLA class II loci. 

In this family, three DRB alleles have been identified in the DNA 
5 of one person, viz. DRl, DR2 and D Rw6. on the basis of positive identi- 
fication with allele-specific oligos (triplet). The allele-specific 
oligos detect the corresponding sequences for the regions which also 
differ at protein level. Figure J shows the nucleotide sequence of DRl 
with the derived amino acid sequence in three-letter and one-letter 
10 code. Ihe underlined regions indicate the location of allele-specific 
oligos. The nucleotide sequence and the associated amino acid sequence 
of the allBle-specific oligonucleotides of the other allele, present in 
this family .re shown in Figure. 2a and 2b. Figure 2a shows the nucleo- 
tide variation in a first polyaorphous region and Figure 2b that in a 
second polymorphous region, corresponding to the underlined regions in 
Figure 1. For a .ore detailed analysis of the DRB genes, the universal 
primer. DRB5S and DRB 3 E described were used. In a control study, the 
use of the universal primers for identification of the ORB alleles DRl 
DR2 and DRw6 was tested, but no deviations were found in the.e ..quen- 
ces. The results of the sequence analysis of control cell lines and DNA 
of family o are shown in Figures 3 a and 3 b. respectively. In the for* of 
the amino acid sequence using the one-letter code. 

It is evident from this that there is a recoabined haplotype in 
this family. DR1 and DR2 alleles being involved. The DRw6.wl9 and 
DRw6,52C alleles are identical to the control sequence.. It appears that 
the alleles which are detected by the DRl-specific oligos and by the 
DR2-specific oligo occur next to each other (Figure 4). There is there- 
fore no question of three allele, being present here but of a DRB haplo- 
type not previously identified. In another section of the gene, a speci- 
fic sequence has been identified for this DR1 * 2 haplotype. 

Of other loci (DQB.. DQA and DPB) in this family, it has now been 
established by using universal primers that there are also only two 
(unstated) alleles. In sddition, said haplotype can be characterised by 
the presence of conserved specific sequences. 
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CLAIMS 

1. Method of determining a genotype by comparing the nucleotide 
sequence of members of a gene system, characterised in that the nucleo- 
tide sequence of a strongly conserved section of the genetic material is 

5 compared. 

2. Method according to Claim i, characterised in that the strongly 
conserved section of the genetic material has a homology of at least 
58*. 

3. Method according to Claim 1 or 2. characterised in that haplotypes 
10 of a gene system are determined. 

ft. Method according to Claim 3, characterised in that loci of a gene 
system are compared. 

5. Method according to Claim ft, characterised in that alleles of a 

gene system are compared. 
15 6. Method according to any of Claims 3-5, characterised in that a gene 

system of the histocompatibilty system is used as gene system. 

7- Method according to Claim 1 or 2. characterised in that a gene 

system of one of the members of the immunoglobulin supergene family is 

used as gene system. 
20 8. Method according to Claim 7. characterised in that the gene system 

of an Immunoglobulin is used as gene system. 

9- Method according to Claim 7, characterised in that the gene system 
of the T-cell receptor is used as gene system. 

10. Method according to Claim 1 or 2. characterised in that tissue 
25 typing or cell typing is carried out. 

11. Method according to Claim 10* characterised in that the degree of 
compatibility of transplants or cells is determined. 

12. Method according to Claim 10, characterised in that the risk is 
determined for a person or animal of a disease associated with HLA 

30 (human leucocyte antigen) or MHC (major histocompatibility complex). 

13* Method according to Claim 1 or 2, characterised in that a gene 
system of a bacterium or of a virus is used as gene system. 
1ft. Method according to any of the preceding claims, characterised in 
that the strongly conserved section is flanked by completely conserved 

35 sequences of at least 5 nucleotides. 

15, Method according to Claim lft # characterised in that the completely 
conserved sequences contain at least 10 nucleotides* 

16. Method according to Claim 3ft or 15, characterised in that the 
strongly conserved nucleotide sequence is amplified making use of the 

ftO completely conserved sequences. 
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17. Method according to Claim 16. characterised in that the .trolly 
conserved nucleotide sequence is stifled using a PGR (polymerase chain 
reaction) . 

18. Method according to Claim 16 or 17. characterised i„ that the 
amplified sequence is subjected to direct sequence* analysis . 

19- Method according to Claim 16 or 17. characterised in that the 
amplified sequence is subjected to denaturing gradient gel electro- 
phoresis (DOGE) . 

20. Method according to Claim 16 or 1 7 , characterised in that the 
-Plified sequence is subjected to temperature-gradient gel electro- 
phoresis (TOOE). 

21. Kit for tracing genetic variation. i„ s gene family which contains 
one or two completely conserved nucleotide sequences having a length of 
at least 5 nucleotides as primer(s> and possibly other means for ampli- 
fying and analysing a strongly conserved section of the genetic material 
to be investigated, which section is flanked by the completely conserved 
nucleotide sequence(s). 
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fig. 1 

DR1 

32 62 
CGTTTCTTGTOaCAGCTTAACTTTCAATCTCATTTCTTCAATCaGACGOAOCGCGTaCGO 
ArgPheLeuTrpGlnLeuLyePheGluCysHiBPhePheAsnGlyThrGluArgValAr^ 
* ' L W. Q L K PBCBFPNGTBRVR 

92 122 
TTGCTGGAAAGATGCATCTATAACCAAGAGGAGTCCGTGCGCTTCGACAGCGACGTGGGG 
LeuL©uGluArgCyBlleTyrA8nGlnaiuGluSerValArffPheAspSerA»pValGly 
L L E R C I V N QE ES VRFDSDVG 

152 182 
GAGTACCGGGCGGTGACGGAGCTGGGGCGGCCTGATGCCGAGTACTGGAAACGGCAGAAG 
GIuTyrArgAlaValThrGluLeuGlyArgProA«pAlaGluTyrTrpAanSerGlnLy« 
EY RAVTELGR PDAEyWNSQK 

212 Zk2 
GACCTCCTGGAGCAGAGGCGGGCCGCGGTGGACACCTACTGCAGACACAACTACGGGGTT 
AspL«uLeuGluGlnArgArgAlaAlaValA«pThrryrCysApgHisAsnTyrGlyVal 
D L LEQRRAAVDTYCRHNYGV 



GGTGAQAGCTTCACAGTGCAGCGGCGAA 
GlyGluSerPheThrValGlnArgArg 
GESFTV QRR 
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Fig. 2a 

DRl 

COT TTC TTG TOO CAG CTT AAG TTT GAA TOT C 
RFL WQLKFEC 

DR2I1 

CGT TTC CTG TOG CAG CCT AAG AGO GAG TOT C 
RPLWQPKREC 

CGT TTC TTG CAG CAG GAT AAG TAT GAG TOT C 
BFLQQDKYEC 

DRw6.wl9 

CGT TTC TTG GAG TAC TCT ACG TCT GAG TGT C 
RFLEYSTSEC 

DRw6.52 c 

CGT TTC TTG GAG CTG CTT AAG TCT GAG TGT C 
RFLELLKSEC 
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Fig, 2b 

DR1 

TTG CTG OAA AGA TGC ATC TAT AAC CAA GAG * 
LLER CIYN QE 

TTC CTG GAC AGA TAC TTC TAT AAC CAG GAG 
FL0RY FYKQE 

TTC CTG CAC AGA GAC ATC TAT AAC CAA GAG 
FLHRD 2YN QE 

DRw6.wl9 

TTC CTG GAC AGA TAC TTC CAT AAC CAG GAG 
FLDRYFHN QE 

DRw6.52 c 

TTC CTG GAC AGA TAC TTC CAT AAC CAG GAG 
FLDRYFRNQE 
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Fig. 3a Control ONA 



DR1 RPLWQLKFECHPFNGTERVRLLERCIYNQEESVRFD 

DR2H p. R F-D-YF 

DR2, 5 Q-D-Y F-H-D DL- — 

DRw6 . wl9 EYSTS F-D-YFH N 

DRW6.52C — -EL— S F-D-YFH F 

Fig. 5b DNA family G 



DR1 _ 

DR1 - like F-H-D DL- 

DR2-like — E-A-C— I F-H-D DL- 

DRw6.wl9 EYSTS F-D-YFH N-- 

DRw6.52 c EL- -S F-D-YFH F— 



- « sane aiino acid as in DR1 at that position. 
» - regions for ASO typing 
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Fig. 4 

Control DNA 





DR1 


« 




DR1 








DR2 


0R2*1 


• 


DR2f5 










DRw6 


DRw6.wl9 


• 


ORw6.52 c 











DNA "triplet" 

DR1 • »R2»5 
DR1 ♦ 2 1 | |,_ 



DRw6.«19 • DRw6.52 c 
DRw6 1 1 ■ 



I specific sequence for DR1 and 2 

pseudogene 
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